Learning Multi-Modal Nonlinear Embeddings: Performance Bounds and an Algorithm
نویسندگان
چکیده
While many approaches exist in the literature to learn low-dimensional representations for data collections multiple modalities, generalizability of multi-modal nonlinear embeddings previously unseen is a rather overlooked subject. In this work, we first present theoretical analysis learning supervised setting. Our performance bounds indicate that successful generalization classification and retrieval problems, regularity interpolation functions extending embedding whole space as important between-class separation cross-modal alignment criteria. We then propose representation algorithm motivated by these findings, where training samples are optimized jointly with Lipschitz interpolators. Experimental comparison recent single-modal algorithms suggests proposed method yields promising image image-text applications.
منابع مشابه
Multi-Modal Bayesian Embeddings for Learning Social Knowledge Graphs
We study the extent to which online social networks can be connected to knowledge bases. The problem is referred to as learning social knowledge graphs. We propose a multi-modal Bayesian embedding model, GenVector, to learn latent topics that generate word embeddings and network embeddings simultaneously. GenVector leverages large-scale unlabeled data with embeddings and represents data of two ...
متن کاملAn Algorithm for Multi-Realization of Nonlinear MIMO Systems
This paper presents a theoretical approach to implementation of the “Multi-realization of nonlinear MIMO systems”. This method aims to find state-variable realization for a set of systems, sharing as many parameters as possible. In this paper a special nonlinear multi-realization problem, namely the multi-realization of feedback linearizable nonlinear systems is considered and an algorithm for ...
متن کاملLearning in an Inclusive Multi-Modal Environment
The findings arising from a case study on improving interaction design for teaching visually impaired students, in an inclusive learning environment, are presented. The original case study identified that a major problem for those with visual impairment when learning computer science (and probably any science or engineering discipline) is the need to draw and appreciate diagrams. The cognitive ...
متن کاملan algorithm for multi-realization of nonlinear mimo systems
this paper presents a theoretical approach to implementation of the “multi-realization of nonlinear mimo systems”. this method aims to find state-variable realization for a set of systems, sharing as many parameters as possible. in this paper a special nonlinear multi-realization problem, namely the multi-realization of feedback linearizable nonlinear systems is considered and an algorithm for ...
متن کاملLearning Multi-modal Similarity
In many applications involving multi-media data, the definition of similarity between items is integral to several key tasks, e.g., nearest-neighbor retrieval, classification, and recommendation. Data in such regimes typically exhibits multiple modalities, such as acoustic and visual content of video. Integrating such heterogeneous data to form a holistic similarity space is therefore a key cha...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE transactions on image processing
سال: 2021
ISSN: ['1057-7149', '1941-0042']
DOI: https://doi.org/10.1109/tip.2021.3071688